Unit 5: Data Matching and Consolidating 


What is Proximity Matching? The ability to match records based on their proximity, instead of 
comparing string representation of data. Data Services provides several types: 


Geographical proximity 


Find duplicate records based on geographic proximity, using latitude and longitude 
information. Not driving distance, but Geographic distance calculated using the Haversine 
Distance algorithm. It uses WGS 84 (GPS) coordinates. 


Allows geographical search of objects... 


...-around a location in a radial Or ...find the closest location 


| Retail Location | ~~ 


Example: Find the store closest to the customer location 


Cust Closest 
Name Cust Address Geo Code Store # 


Margaret 1429 W Elizabeth St, (40.575874, - 
Roberts Fort Collins, CO 80522 105.101652) 1544 


Neil 942 California Ave, Salt (40.74043, - 
Nevue Lake City, UT,84115 111.935701) 4403 


Example: Find all stores in a radial range 
Figure 31: Geographical Proximity Matching 


Numeric proximity 





This method either finds duplicates based on numerical closeness of data based either on 
numbers or date. 


e Numeric proximity — Find duplicates based on numerical closeness of data. 


e Date proximity — Find duplicate based on date ranges. 
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